Reinforcement learning produces dominant strategies for the Iterated Prisoner’s Dilemma
نویسندگان
چکیده
We present tournament results and several powerful strategies for the Iterated Prisoner's Dilemma created using reinforcement learning techniques (evolutionary and particle swarm algorithms). These strategies are trained to perform well against a corpus of over 170 distinct opponents, including many well-known and classic strategies. All the trained strategies win standard tournaments against the total collection of other opponents. The trained strategies and one particular human made designed strategy are the top performers in noisy tournaments also.
منابع مشابه
The Cross Entropy Method for the N-Persons Iterated Prisoner’s Dilemma
We apply the Cross-entropy method to the N persons Iterated Prisoners Dilemma and show that cooperation is more readily achieved than with existing methods such as genetic algorithms or reinforcement learning.
متن کاملOpponent Modelling and Strategy Evolution in the Iterated Prisoner’s Dilemma
Learning and evolution are two adaptive processes in the natural world that have been modelled in the study of artificial intelligence in computer science. In both biology and in artificial intelligence, learning and evolution are complementary processes. The nature of the interactions between learning and evolution has been the subject of much research in scientific disciplines. Evolution of a...
متن کاملTowards Cooperation in Sequential Prisoner's Dilemmas: a Deep Multiagent Reinforcement Learning Approach
The Iterated Prisoner’s Dilemma has guided research on social dilemmas for decades. However, it distinguishes between only two atomic actions: cooperate and defect. In real-world prisoner’s dilemmas, these choices are temporally extended and different strategies may correspond to sequences of actions, reflecting grades of cooperation. We introduce a Sequential Prisoner’s Dilemma (SPD) game to b...
متن کاملRole of Iterated Prisoner’s Dilemma in Genetic Based Machine Learning
Several strategies have been followed by most of earlier researchers in the field of machine learning. Agarwal has connected Machine Learning with Iterated Prisoner’s Dilemma Problem [IPD]. Holland has proposed basic directions to explore goal of genetic operators in the study of machine learning. Axelrod connected Genetic Algorithm with IPD. We integrate these basic approaches to give a novel ...
متن کاملAn Experimental Study of N-Person Iterated Prisoner's Dilemma Games
The Iterated Prisoner’s Dilemma game has been used extensively in the study of the evolution of cooperative behaviours in social and biological systems. There have been a lot of experimental studies on evolving strategies for 2-player Iterated Prisoner’s Dilemma games (2IPD). However, there are many real world problems, especially many social and economic ones, which cannot be modelled by the 2...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 12 شماره
صفحات -
تاریخ انتشار 2017